Rating for sub-properties using latent topics

نویسنده

  • Madhavi Yenugula
چکیده

Tripadvisor is a travel website which provides user generated reviews for travel-related content. I used the Tripadvisor dataset available here here. The dataset has a total of 1621956 reviews about 12,773 hotels with an average of 126 reviews for each hotel. Hotel’s information provided includes its name, location, price etc. A review for a hotel includes the author’s location, date, rating and review text. The rating aspect of Tripadvisor dataset is particularly interesting as it provides ratings for other aspects/sub-properties of the hotel in addition to the overall rating. This information is helpful for a customer to take an informed decision about his choice based on his specific preferences. From the number of ratings per month, we can see that there are particular months during which people take vacation the region from Aug-December. It is consistent with the general knowledge that people take vacation during holidays. This trend is observed during all the years from 2003 to 2012 from 2. It can also be seen that the number of people using/rating places on Tripadvisor increased almost exponentially over the years.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Probabilistic Multi-Topic Matrix Factorization for Rating Prediction

Recently, Local Matrix Factorization (LMF) [Lee et al., 2013] has been shown to be more effective than traditional matrix factorization for rating prediction. The core idea for LMF is to first partition the original matrix into several smaller submatrices, further exploit local structures of submatrices for better low-rank approximation. Various clustering-based methods with heuristic extension...

متن کامل

Joint Author Sentiment Topic Model

Traditional works in sentiment analysis and aspect rating prediction do not take author preferences and writing style into account during rating prediction of reviews. In this work, we introduce Joint Author Sentiment Topic Model (JAST), a generative process of writing a review by an author. Authors have different topic preferences, ‘emotional’ attachment to topics, writing style based on the d...

متن کامل

Semantic Text Segmentation and Sub-topic Extraction

Semantic Text segmentation and sub-topic extraction divides the input text into coherent paragraphs and extracts topics out of them. This enables applications to extract relevant meaningful data that could be useful in many text analysis tasks like information retrieval and summarization. In this project we have combined the techniques of text tiling and latent semantic analysis and have come u...

متن کامل

Rating Prediction with Topic Gradient Descent Method for Matrix Factorization in Recommendation

In many online review sites or social media, the users are encouraged to assign a numeric rating and write a textual review as feedback to each product that they have bought. Based on users’ history of feedbacks, recommender systems predict how they assesses the unpurchased products to further discover the ones that they may like and buy in future. A traditional approach to predict the unknown ...

متن کامل

A Comprehensive Study on a Latent Heat Thermal Energy Storage System and its Feasible Applications in Greenhouses

Abstract Energy crisis is a major challenge in the current world. Latent heat thermal energy storage (LHTES) systems are known as equipment with promising performance by which thermal energy can be recovered. In the present study a comprehensive theoretical and experimental investigation is performed on a LHTES system containing PEG1000 as phase change material (PCM). Discussed topics can be ca...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015